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AN ELEMENTARY PROOF OF THE A 2 BOUND 


MICHAEL T. LACEY 


Abstract. A martingale transform T, applied to an integrable locally supported func¬ 
tion f, is pointwise dominated by a positive sparse operator applied to |f|, the choice of 
sparse operator being a function of T and f. As a corollary, one derives the sharp A p 
bounds for martingale transforms, recently proved by Thiele-Treil-Volberg, as well as a 
number of new sharp weighted inequalities for martingale transforms. The (very easy) 
method of proof (a) only depends upon the weak-L * 1 norm of maximal truncations of 
martingale transforms, (b) applies in the vector valued setting, and (c) has an extension 
to the continuous case, giving a new elementary proof of the A 2 bounds in that setting. 


1. Introduction 

Our subject is weighted inequalities in Harmonic Analysis, which started in the 1970's, 
and has been quite active in recent years. A succinct history begins in 1973, when Hunt- 
Muckenhoupt-Wheeden [HMW] proved that the Hilbert transform H was bounded on 
L 2 (w), for non-negative weight w, if and only if the weight satisfied the Muckenhoupt 
A 2 condition [M2], One could have asked, even then, what the sharp dependence on the 
norm of H is, in terms of the A 2 characteristic of the weight. This was not resolved until 
2007, by Petermichl [P], showing that norm of the Hilbert transform is linear in the A 2 
characteristic. Five years later, among competing approaches, Hytonen [HI] established 
the A 2 Theorem: For any Calderon-Zygmund operator T, and any A 2 weight, the norm 
estimate is linear in A 2 characteristic. Hytonen’s original method depended upon the 
Hytonen Representation Theorem, [HI, Thm. 4.2], a novel expansion of T into a rapidly 
convergent series of discrete dyadic operators. Subsequently, Lerner [L4] gave an alternate 
proof, cleverly exploiting his local mean oscillation inequality [L2] to prove a remarkable 
statement: The norm of Calderon-Zygmund operators on a Banach lattice is dominated by 
the operator norm of a class of very simple positive sparse operators S|f|. See [L4, (1.2)]. 
The A 2 bound is very easy to establish for the sparse operators. For more details on the 
history, see [H2, L4], 

The purpose of this note is to establish the pointwise control of Tf by a sparse operator 
in a direct elementary fashion. For each T and suitable f, there is a choice of sparse 
operator S so that |Tf| < S|f|. Moreover, the recursive proof only requires elementary 
facts about dyadic grids, and the weak integrability of the maximal truncations of T. (That 
is, we do not need Lerner’s mean oscillation inequality.) Before this argument, pointwise 
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bounds were independently established by [CAR, Ll\l], This argument, in the continuous 
case, requires a bit more on the operators than our result. 

The argument is especially transparent in the setting of martingale transforms §2. The 
main result, Theorem 2.4, proved in §3, is new, and cannot be proved by Lerner’s mean 
oscillation inequality. As corollaries, we deduce the A 2 bound for martingale transforms 
recently established by Thiele-Treil-Volberg [TTV], who employ more sophisticated tech¬ 
niques. A number of new, sharp weighted inequalities for martingales also follow as 
corollaries. The argument can be formulated in the Euclidean setting, giving a new self- 
contained proof of the A 2 Theorem in that setting. Remarks and complements conclude 
the paper. 

Acknowledgment. Guillermo Rey kindly pointed out some relevant references, as did the 
referee, who also pointed out some additional obscurities. 


2. Martingale Transforms 


We introduce standard notation required for a discrete time martingale. Let (£2, A, p.) 
be a cr-finite measure space, and let {Q n : n 6 Z} be collections of measurable sets 
Q n Ci such that 

(1) Each Q n is a partition of jQ, and 0 < p.(Q) < 00 for all Q e Q n . 

(2) For each n, and Q € Q n , the set Q is the union of sets Q' e Q n + 1 - (And Q' is 
allowed to equal Q.) 

(3) The collection Q := U n ez Qn generates A. 

The set Q is a tree with respect to inclusion. (And the branching of the tree is unbounded 
in general.) Set Q Q to be the minimal element P 6 Q which strictly contains Q. In words, 
Q Q is the parent of Q. Denote by ch(P) those Q e Q such that Q a = P, that is the 
children of P. 

The conditional expectation associated with Q n is 


E(f|QJ:= X M-(Q ) -1 

Q^Qn 


f dp • 1 Q . 

Q 


Below, we will frequently use the abbreviation (f)qj := p(Q) 1 Jq f dp. The martingale 
difference associated with P e Q is 


Apf lQ((f)q — (f)p). 

QGch(P) 


One should note that P 6 Q n is allowed to be in Q n+ 1 , so that the difference above is 
equal to (E(fl P | Q n +i) — (f)p)lp only if P 6 Q n , but not Q n +i- 

There are two kinds of operators we consider. The first are martingale transforms, 
namely operators of the form 


(2.1) Tf = ^ epApf, | ep| < 1. 
pgq 
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By the joint orthogonality of the martingale differences, T is an L 2 -bounded operator. 
And the second category are sparse operators. An operator S is sparse if 

(2.2) Sf = £<f>pl P 

pgs 

where the collection S satisfies this sparseness condition: For each P G S, 

(2.3) Y. M-(Q) < ^(P). 

Qech 5 (P) 

The sum on the left is restricted to the 5-children of P: The maximal elements of S that 
are strictly contained in P. It is well-known that both classes of operators are bounded in 
L p , with constants uniform over the class of operators. 

One of the main results is this extension of the Lerner inequality [L4] to the martingale 
setting, with a very simple proof, see §3. 

Theorem 2.4. There is a constant C > 0 so that for all functions f G L 1 (p.) supported 
on Qo G Q, and martingale transforms T, there is a sparse operator S = Sjj so that 

(2.5) l Qo |Tf| < C ■ S|f|. 

The same inequality holds for the maximal truncations, Tjj, as defined in (3.2). 


From this Theorem, one can deduce a range of sharp inequalities for martingale trans¬ 
forms by simply repeating the proofs for sparse operators on Euclidean space, using for 
instance the arguments in [L3, §2], 

Consider two weights w, cr. For pair, we have the joint A p condition, 1 < p < oo, 
given by 


[c,w] Ap 


:= sup(cr)^ 1 (w) Q 
QeS 


sup 

n 


O- | Qr, 


IP-1 


E(W | Q n )||c 


If w is non-negative p-almost everywhere, and is also a weight (that is, w 1 ^ 1 ^ 

is locally integrable), then w]a p = [w] Ap is the A p characteristic of w. 

This is the Az Theorem in the martingale setting, as proved by [TTV], After the 
domination by sparse operators, one can follow the short and self-contained proof of 
[Ml, Thm 3.1], 


Theorem 2.6. For any martingale transform or sparse operators T, for any 1 < p < oo, 
and A p weights w, 

||T : L p (w) i— } L p (w)|| < [w]™ n{1,1/(p ~ 1)} . 

One draws the corollary that the martingale differences Aq are unconditionally conver¬ 
gent in L p (w), for all 1 < p < oo, and all A p weights w, which fact has been known 
since the 1970’s. We return to this topic in §5. 

Many finer results are also corollaries, and the reader can use [L3, §2], and [HL, LI, 
NRTV, L4, TV1] as guides to these results. 
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One further application is to certain martingale paraproduct operators. The Theorem 
below follows from an easy modification of the proof that follows, but we suppress the 
details. Define an operator TTf := ^q s q("0q ' t>Q> where {bq : Q 6 Q} are a sequence 
of functions which satisfy Aqbq = bq, and this normalization condition. 

sup M-(P) 1 Y- ll b Qll~M-(Q) < 1- 

P Q:QCP 

It is an easy consequence of the Carleson embedding theorem that FT is an L 2 -bounded 
operator. They are considered in [TV1, Thm 2.5], and they too can be dominated by a 
sparse operator. 

Theorem 2.7. Let f 6 L 1 (Q) be supported on Q 0 G Q. Then, there is a sparse operator 
S such that lq 0 |TTf| < Sf. 

A number of weighted inequalities for these operators is an immediate consequence, 
including both of the main results of [TV1]. 

3. Domination by Sparse Operators: Martingales 

We give the proof of Theorem 2.4, with the basic fact being the weak-L 1 inequality for 
maximal truncations of martingale transforms. 

Theorem 3.1. [Burkholder [B, Thm 6]/ For any martingale transform T we have 
supA|x(T(jf > A) < ||f|| L i M , 

A>0 

where Tf is the maximal truncation, given by 

(3.2) Tjjf := sup Y e Q A Q f • 

QDQ' 

Proof of Theorem 2.4. We concentrate on the case of T being a martingale transform 
as in (2.1), with the changes to accommodate the maximal truncations being easy to 
provide. 

Apply the weak L 1 inequality for the maximal function Mf, and the maximal truncations 
of the martingale transform. Thus, there is a constant Co so large that the set 

E := {max{Mf, T # f} > ^C 0 <|f|> Qo } 

satisfies p(E) p(Qo). Let S be the collection of maximal elements of Q contained in 
E. We claim that 

(3.3) |Tf(x)|l Qo < C 0 (|f|) Qo + 2jT P f(x)|, 

Pe£ 

where Tpf := epa(f)plp ^ CqAqf. 

Q : QcP 
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Here, P a is the parent of P. It is clear that we will add Q 0 to S, the sparse collection 
defining S. And, one should recurse on the T P f, at which stage the collection £ becomes 
the 5-children of Qo. Continuing the recursion will complete the proof. 

If x G Qo \ E, certainly (3.3) is true. For x 6 E then there is a unique P G £ with 
x G P, for which we can write 

(3.4) Tf(x) = ^ e Q A Q f(x) — e P a(f) P a + Tpf(x). 

Q:P a SQ 

Note that the martingale difference A P af(x) is split between the second and third terms 
above. The first two terms on the right in (3.4) are, by construction, bounded by 
4-Co(|f |)q 0 - Thus (3.3) follows. 

□ 


4. Domination by Sparse Operators: Euclidean Case 


We extend the martingale proof of (2.5) to the Euclidean setting, yielding an inequality 
that applies to (a) the most general class of Calderon-Zygmund operators, and (b) in the 
vector valued setting. 

Let T be an L 2 (R d ) bounded operator, of norm 1, and with kernel K(x,y). Namely for 
all f, g G Co(K. d ), whose closed supports that do not intersect, we have 


(Tf, g> 


K(x,y)f(y)g(x) dxdy. 


Let co : [0, oo) i—> [0,1) be a modulus of continuity, that is a monotone increasing and 
subadditive function. The operator T is said to be a Calderon-Zygmund operator if the 
kernel K(x, y) satisfies 


|K(x,y) — K(x',y)| < cu(^ 

and the dual condition, with the roles of x and y reversed, holds. The classical condition 
to place on cu is the Dini condition 


x — x'|\ 1 

|x-y| / l*-y| d ’ 


(4.1) 


rl ( \ dt 

cu ft j — < oo. 

0 t 


The Theorem below is a stronger version of Lerner’s inequality [L4], and its extension in 
[CAR, Corollary Al]. Moreover, it only assumes the Dini condition, while prior approaches 
[HLP,L4,CAR] require 1/t in the Dini integral be replaced by (log2/t)/t. 

Theorem 4.2. Let T be a Calderon-Zygmund operator for which the modulus of con¬ 
tinuity satisfies the Dini condition (4.1). Then, for any compactly supported function 
f G L^Kf 1 ), we have |Tf| < S|f|. where S = S(f, T) is a sum of at most 3 d operators, 
each of which are sparse relative to a choice of dyadic grid on R d . The same conclusion 
holds for the maximal truncations T : , as defined in (4.4). 
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To explain the conclusion of the Theorem, we say that V is a dyadic grid of R d , if V 
is a collection of cubes Q C l d so that for each integer k 6 Z, the cubes := {Q G 
V : |Q| = 2~ kd } partition R d , and these collections form an increasing filtration on R d . 
Then, a sparse operator is one that is sparse in the sense of (2.3), relative to a dyadic 
grid and Lebesgue measure. 

To prove the Theorem, we use the standard fact that the maximal truncations of T 
satisfy a weak-L 1 inequality, see [S, §4.2], 

Lemma 4.3. For a Calderon-Zygmund operator T with Dini modulus of continuity, we 
have A|{T(jf > A}| < ||f||i, for all A > 0, where T# are the maximal truncations 


(4.4) 


Tjjf(x) := sup 

6>0 


K(x,y)f(y) dy 

- l*-ul>5 


There are a finite number of choices of grids which well-approximate any cube in R d . 
This old observation has a proof in [HLP, Lemma 2.5]. 

Lemma 4.5. There are choices of dyadic grids D u , for 1 < u < 3 d , so that for any cube 
P C R d , there is a choice of 1 < u < 3 d , and dyadic Q G V u such that P C Q, and 
£Q < 6£P, where fP = |P1 1 4 d is the side length of P. 


The grids of the Lemma are defined by appropriate shifts of a dyadic grid. Abusing 
common terminology, we say that Q is a dyadic cube if it is in any of the collections V u , 
1 < u < 3 d . And, we will write Q G 2\(q). 

Since the proof of Theorem 4.2 is recursive, we need to adapt the definitions of maximal 
functions and maximal truncations to a cube P. Set, for x G P, 


Mpf(x) := sup A t f(x), 

0<t<dist(x,9P) 


A t f(x) 


|f(x-y)|t|>(y/t) dy/t d . 


Here and below, \Jj> is a smooth function with l[_i/2,i/2]<i < 4 > < Likewise, the 

maximal truncations are adapted to a particular cube P: For x G P, define 


T(t,pf(x) := sup |T S)t f(x)|, xgP, 

0<s<t< p dist(x,9P) 


where T S)t f(x) 


K(x,y)f(y)[tK(x-y)/t) —iK(x-y)/s)] dy. 


The truncation levels are taken to stay inside P, as we measure the distance with respect 
to the £°° norm on R d . For x i P, set M P f(x) = Tjj P f(x) = 0. It follows that this 
definition is monotone, an important fact for us: For P C Q 

(4-6) T(j iP f = Tjj )P (fl P ) < T^qf. 

Both M P and Tj >P still satisfy the weak L’ inequality. 

The main step in the recursion is in this. 
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Lemma 4.7. There is a finite constant C so that for all integrable functions f supported 
on a cube P, there is a collection V[V) of dyadic cubes Q C P, so that: 

(1) (The cubes have small measure.) There holds 

Y_ |Q| < 3 -3d ~ 3 |P|. 

QeP[ P) 

(2) (The cubes are approximately disjoint.) If Q' C Q, and Q', Q G V if?) , then 

Q' = Q- 

(3) (Tf is controlled.) Pointwise, there holds 
(4.8) T(|,pf < C(|f|) P + max T tt(Q f. 


The last point is an extension of (3.3). 

Proof. For a large constant C to be determined, set 

E(C) := {x G P : max{M. P f(x),Tjj )P f(x)} > C(|f|) P }. 

It follows from Lemma 4.3, that |E(C)| < C _1 |P|. For x G E(C), let 0 < ff x < dist(x, 3P) 
be the largest choice of 0 < s < dist(x, 3Qo) such that either A s f(x) = C(|f|)p, or there 
is a s < T x < rjE-dist(x, 3Q 0 ) with |T SjTx f(x)| = C(|f|)p. 

We have this (standard) lower semi-continuity result: There is an absolute 0 < p < 1 
so that 


(4.9) max{|T ffX(Tx f(x , )|, A ffx f(x')} > £C(|f|) P , |x-x'| < pcr x . 

For the averaging operator, if A ffx f(x) > C(|f|)p, this is very easy, and so we proceed 
under the assumption that A s f(x) < C(|f|)p for all cr x < s < dist(x, 9Qo). It is then 
the truncated singular integral that is large, and we have to rely upon the Dini condition: 
Abbreviating a familiar calculation, and assuming that t < cu(t), for 0 < t < 1, as we 
can do, assuming |x — x'| < pcr x , 


|T ffx ,T X f(x') -T ffX)Tx f(x)| < 


cu 


cr x /2<|x—y|<T x 

'OO 

CU 

Cx/2 
r 2p 

cu(s 

0 


/ p°- x \ 


Vx-y/ 

|x-y| 


dy 


^(|f|)p 

^(1% 


(<T 

s ds 
s 


dt 

t 


Thus, under the Dini condition (4.1), we see that for p = p(4>, cu) sufficiently small, (4.9) 
holds. (In particular, p is independent of C.) 

There is a second upper semi-continuity result. Set B x := x + cr x [—1 /2,1/2] d C P. For 
any y G B x , and any ^cr x < s < t < ^dist(x, 3P), 


(4.10) 


K(y,z)f(z)[tp(y -z/t) -iM(y -z)/s)] dz 


^C(|f|) P . 
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That is, on B x , very large values of Tjj iP f(y) cannot arise from fl(2B x ) c < since otherwise 
we violate the definition of cr x . 

We have just seen that pB x C E(C/2). Thus, 

|E 0 |<p- d C- 1 |P|, E 0 := U B *- 

xeE(C) 


And, ify G P\E 0 , then |T # , P f(y)| < C(|f|) P . 

The path from the cubes B x to dyadic cubes uses Lemma 4.5. To each cube B x , 
associate a choice of dyadic cube Q x D B x with £Q X < 6cr x . Since 12cr x < dist(x, 3P), it 
follows that Q x C P. Then, we require 

I E-| j < 3 3d ^ 3 1P|, E! := U Q x , 

xGE 0 

which is true for large enough C, thus specifying this constant. 

The collection V{V) claimed in the Lemma are the maximal cubes in {Q y : y G E 0 }. 
Properties (1) and (2) are immediate, and it remains to check (3), namely to bound 
Tfl.pfCy)- If y G P\Eo, then T t j ) pf(y) < C(|f|) P . And, otherwise, y G B y C Q, for some 
Q G 'P(P). But it follows from (4.10) and monotonicity again that we have 

T(t,pf(y) < C'(|f|) P + Tjj )By f(y) < C'(|f|) P + Tjj iQx f(y). 

Here, C' is the implicit constant in (4.10). And, this completes our proof. □ 


Proof of Theorem 4.2. Take a compactly supported function f G L^RQ, and a dyadic 
cube P G D U (P) large enough that (Tjjf)lp < T^pf + C(|f|)p. (The term (|f|)p is added 
to address boundary effects in the definition of T^pf.) The standard kernel estimate then 
implies that off of P, there holds 


T # f(x)lpc(x) < 


llflh 

[£P + dist(x, P)] d " 


Apply Lemma 4.5 to the cubes 2 n P, n > 1, to select dyadic P n meeting the conclusion 
of the Lemma. Add P to <S u (p), and P n to 5 u ( Pn ), where C V u is the sparse collection 
of dyadic cubes that defines S u . Then, it follows that 


3 d 

T(tf(x) < ^S u |f|(x), x*P. 

U=1 


It remains to dominate Tjj )P f. Apply Lemma 4.7, so that in particular (4.8) holds. We 
will rewrite (4.8) in the form below, to set up the recursion. 

3d 



Above, all of the sparse operators are zero, except for S d (P ), and the associated sparse 
collection is <S® (P ) = {P}. We will refer to the cubes in V[V) — To as JJ -descendants of P. 
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The inductive step is to apply Lemma 4.7 to those Q G Vo with maximal side lengths. 
The details are as follows. Suppose that we have this estimate below for some integer 
t > 0. 



where these properties hold: 

TO: is a operator defined by collection C T> Ul for 1 < u < 3 d , as in the 

equation (2.2). (The operator will be sparse, but we do not assert that at this 
point.) 

Tl: All cubes Q G Vt are dyadic. If Q' C Q for any two Q, Q' G V t , then Q' = Q. 

Each Q G V t is a {{-descendant of some cube S G Uu-i «Su- 
T2: If Q fl S =£ 0, where Q G Vt and S G S l := Uuii then fQ < ^S. 

Then, the recursive step is as follows. Let VI be those Q G V t such that fQ is maximal 
among all side lengths of the cubes {Q : Q G V t }. Apply Lemma 4.7 to each Q G V*. 
We have 


T tt, Q (f) < c <l f l>Q + max. T #iQ ,(f), 


Q'6 VIQ) 


where the cubes V{Q) satisfy the conclusions of Lemma 4.7. 
It follows from (4.11) that pointwise, 


3 d 

Vf<c£st|f| 

U=1 


+ max 


max Tjj n(f), max{C(|f|) Q l Q + 
Q£Vt\V* Qev*'- l/v v 


max 

Q'eP(Q) 




In the last line, the new operator S^ +1 is obtained from 5^ by adding to <5^ those cubes 
{Q G V* : u(Q) = u}. By monotonicity (4.6), we take the collection P t+ i to be those 
cubes in 


(n\n*)uU^Q) 

V* 

that are maximal, with respect to inclusion. 

After the recursion finishes, it only remains to show that the limiting operators S u are 
sparse. Take S G <S U , and fix the smallest integer t such that S G 5*. There are three 
sources of «S u -chiIdren for S, either directly, or through further {{-descendants: 

Type A: Cubes in Q G V[S'), for some S' G with fS' > fS. 

Type B: Cubes in Q G V{S'), for some S' G S t with £S' < £S. 
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Cubes of Type A cannot occur: Since £S' > £S, the smallest integer t' such that 
S' G 5*, is strictly smaller than t. Let Q G P(S') be contained in S. If at any time in 
the recursion from t' < s < t, both cubes Q and S were potential members of V s , then 
Q would have been eliminated by monotonicity, namely using (4.6). 

Concerning cubes of Type B, fix an integer n, and let B n be those S' G UjLi •SJ' °f s '^ e 
length 2~ n £S that intersect S, but are not contained in it. These are necessarily contained 
in (1 +2~ n )S \ (1 — 2^" n S), for dimensional constant c. And, they can overlap at most 
3 d times. The total measure of all ^-descendants of each S' is at most 3~ 3d ~ 3 |S'|, so the 
total measure of potential children from all of these cubes is at most 

oo oo 

s -3d-3^- Y_ |s'| < 3~ 2d_3 1Sj ^(1 + 2~ n ) d — (1 — 2^ n ) d 

n=0 S'G^n n=0 

oo 

< 3~ d ^ 3 Y_ 2 ^ n+1 1S | < 3 d 1 1SI . 

n=0 

Therefore, the collections are sparse, and the proof of the Theorem is complete. 

□ 


5. Complements 

In late 1970’s, several authors considered martingale analogs of the A p theory. For 
instance, lzumisawa-Kazamaki[IK], proved a variant of the Muckenhoupt maximal function 
result [M2] in this setting. When it came to martingale transforms, the distinction between 
the homogeneous and non-homogeneous cases was already recognized by these authors. 
Nevertheless, norm inequalities for martingale transforms were proved by Bonami-Lepingle 
[BL, Th. 1] in 1978. 1 Their proof follows the good-A approach, which does not yield sharp 
constants. 

The Lerner mean oscillation inequality [L2] gives a pointwise bound on an arbitrary 
measurable function c(o by a sum over a sparse dyadic collection of cubes, Q G S. Applying 
this inequality Calderon-Zygmund operators T applied to L 1 function f, leads a number 
of ‘tail issues’, see [H LP , L4] . 

Lerner-Nazarov and Conde-Rey [CAR,LN] provide pointwise domination of Tf by sparse 
operators. Their proofs work on dyadic operators, and ‘lose a log’ in passage to the 
continuous case. 

The proof just described recursively selects advantageous sparse cubes, requiring only 
the weak-L 1 inequality for maximal truncations, and basic facts about dyadic grids. Thus, 
the technique works with only modest changes for UMD valued functions. (In contrast, 
the mean oscillation inequality requires a concept of ‘median’, see [HH].) 

1 These references do not seem to be as well known as they should be. Beyond the setting of discrete 

martingales of this paper, some of these references consider continuous time martingales, with cadalag 

sample paths. 
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By an observation of Treil-Vol'berg [TV2], the argument in § 4 can be adapted to 
the non-homogeneous Calderon-Zygmund theory. In particular, a variant of Theorem 4.2 
holds in this setting, and it implies this result, stated in the language of [OP, NTV], 


Theorem 5.1. [Treil, Volberg [TV2]/ Let (X, d) be a geometrically doubling space, Let 
(j. be a measure on X which is of order m, that is p.(B(x, r)) < r m , where B(x, r) := {y : 
d(x, ij) < r}. Let T be an order m L 2 -bounded Calderon-Zygmund operator on (X, p.). 
For any locally finite weight w on X, positive almost everywhere, such that cr := w 1 is 
also a weight, there holds 


T : L 2 (w) i— > L 2 (tv)|| < sup 

x,r 


w(B(x,r)) cr(B(x,r)) 
p.(B(x,3r)) p.(B(x,r))’ 


A third application of the technique of this paper has lead to pointwise bounds for op¬ 
erators beyond the Calderon-Zygmund scale, an interesting new development by Bernicot- 
Frey-Petermichl [BFP] . 

Further applications to the multi-linear Calderon-Zygmund theory [LMS, DLP] should 
also be possible. In particular, [CAR, Corollary A.l] dominates a multi-linear Calderon- 
Zygmund operator by a sparse operator, and from there one deduces weighted inequalities. 
The modulus of continuity of the operator is assumed to satisfy a logarithmic Dini condi¬ 
tion, which can presumably be relaxed to just a Dini condition. 
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